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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 

1 1/20/08 has been entered. 

Response to Arguments 

2. Applicant's arguments filed 11/20/08, regarding claims 1 - 3, 6, 7, 10-14, 16 — 
21 , 23, 24, and 26-31, and 36 - 40 have been fully considered but they are not 
persuasive. 

Applicant argues that Gillis does not teach a weighting factor relating to an 
importance, based on characteristics of the document of said term (Amendment, pages 
15-17). 

The examiner disagrees, since Gillis disclose "Selected terms within a multiterm 
query may be weighted if desired to reflect their importance to a user. The weightings 
of word relationships are entirely natural domain specific (col.42, line 62 - col.43, 
line 1; col.23, lines 35 -40). 
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Claim Rejections - 35 USC § 101 
3. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

Claims 1, 2, 3, 6, 7, 10 - 14, 16-21, 34 and 36-40 are rejected under 35 
U.S.C. 101 as not falling within one of the four statutory categories of invention. 
Supreme Court precedent 1 and recent Federal Circuit decisions 2 indicate that a 
statutory "process" under 35 U.S.C. 101 must (1) be tied to another statutory category 
(such as a particular apparatus), or (2) transform underlying subject matter (such as an 
article or material) to a different state or thing. While the instant claim(s) recite a series 
of steps or acts to be performed, the claim(s) neither transform underlying subject 
matter nor positively tie to another statutory category that accomplishes the claimed 
method steps, and therefore do not qualify as a statutory process. Claims 1 , 2, 3, 6, 7, 
1 0 - 1 4, 1 6 - 21 , 34 and 36 - 40 do not identify the apparatus that accomplishes the 
method steps like "processing the semantic vector by a digital computer" described in 
page 12, paragraph 36, last three lines of the specification. Thus claims 1, 2, 3, 6, 7, 10 
- 14, 16 - 21 , 34 and 36 - 40 do not define a statutory process. 



1 Diamond v. Diehr, 450 U.S. 175, 184 (1981); Parker v. Flook, 437 U.S. 584, 588 n.9 (1978); Gottschalk v. 
Benson, 409 U.S. 63, 70 (1972); Cochrane v. Deener, 94 U.S. 780, 787-88 (1876). 

2 In re Bilski, 88 USPQ2d 1385 (Fed. Cir. 2008). 
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Claim Rejections - 35 USC § 102 

4. The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

5. Claims 1 , 2, 3, 6, 7, 1 0 - 1 4, 1 6 - 21 , 23, 24, 26 - 31 , and 36 - 40 are rejected 
under 35 U.S.C. 102(a) as being anticipated by Gillis (US Patent 6,523,026). 

As per claims 1 , Gillis teaches comparing semantic content of two or more 
documents, comprising: 

accessing two or more documents ("source and target domains"); performing a 
linguistic analysis on each document ("computing a set of vectors"; col. 10, lines 9-17); 
col.11, lines 36-40); 

defining a semantic vector for each document based on the linguistic analysis, 
said semantic vector having multiple components, wherein each component of said 
semantic vector has at least: a term included in the document or a synonym of said 
term; a weighting factor relating to an importance ("Selected terms within a multiterm 
query may be weighted, if desired, to reflect their importance to the user"), based on 
characteristics of the document of said term ("The weightings of word relationships 
are entirely natural domain specific"); and a frequency value relating to a number of 
occurrences of said term ("computing a set of term vectors"; col.1 1 , lines 36, and 37; 
col. 10, lines 18-20; col.42, line 67- col.43, line 1; col.23, lines 35 -40). 
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As per claim 2, Gillis further discloses that the linguistic analysis comprises 
sentence analysis ("sentence in the individual documents"; col.43, lines 43-46). 

As per claim 3, Gillis further discloses that the sentence analysis comprises a 
syntactic analysis ("preferred stop list word include in the vectorization") and a semantic 
analysis ("semantic similarity"; col. 39, lines 14 - 20; col. 35, lines 4 - 6). 

As per claim 6, Gillis further discloses that each component of the semantic 
vector for at least one of the documents comprises multiple dimensions ("n dimensional 
space"; col. 39, line 63-col.40, line 1). 

As per claim 7, Gillis further discloses that each component of the semantic 
vector for at least one of the documents further comprises a subordinate concept value 
("cable" is the subordinate concept of term "telecommunications"; col. 51, lines 30 - 35). 

As per claim 1 0, Gillis further discloses that some of the components of the 
semantic vector have for at least one of the documents {main term - subordinate term 
pairs} as their first value ("cable" and "telecommunications" are related term pairs, 
wherein cable is the subordinate term of telecommunications; col. 51, lines 30 - 35). 



Application/Control Number: 10/766,308 Page 6 

Art Unit: 2626 

As per claim 1 1 , Gillis further discloses that the semantic vector comprises a 
multi-dimensional vector defined by the content of a semantic net ("n dimensional 
semantic space"; col. 39, line 63 - col.40, line 1). 

As per claim 1 2, Gillis further discloses that the content of the semantic net is 
augmented by relative weights, strengths, or frequencies of occurrence of the features 
within the semantic net ("frequency related weightings to term in the computation of 
summary vectors"; col.41 , lines 40 - 46). 

As per claim 14, 23, 24, 26, and 27, Gillis teaches comparing two or more 
documents, by: 

linguistically analyzing two or more documents to identify at least one term group 
in each document, each tern group comprising a main term and at least one subordinate 
term semantically related to the main term ("a small subset of terms (or groups of terms 
such as phrases) is chosen from the source domain... computing a set of vectors"; 
col. 10, lines 9-22); col.11, lines 36-40); 

generating a semantic vector associated with each document, the semantic 
vector comprising a plurality of components, each component including; a term group in 
the document; a frequency value relating to a number of occurrences of the term group; 
and a weighting factor relating to an importance ("Selected terms within a multiterm 
query may be weighted, if desired, to reflect their importance to the user"), based on 
characteristics of the document of said term ("The weightings of word relationships 
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are entirely natural domain specific") of at least part of the term group ("computing a 
set of term vectors... Vector of terms that occurred less frequently in the training corpus 
are weighted more heavily in the calculation of summary vectors of search domain 
records"; col.11, lines 36, and 37; col. 10, lines 18-20; col.42, line 67- col.43, line 1; 
col .41 , lines 43 - 47; col.23, lines 35 -40); and 

comparing the semantic vectors using a defined metric ("summary vectors to be 
compared"; col. 39, line 19, and 20; col.42, lines 2, and 3); 

wherein said metric measures the semantic distance between two documents as 
a function of at least the frequency values included in the semantic vectors for the two 
documents ("semantically distant are individually represented at least 50 times"; col.48, 
lines 48 - 55; col.51 , lines 30 - 35; col.41 , lines 43 - 47). 

As per claim 16, Gillis further discloses that the main term includes synonyms of 
the main term (col.1 1 , line 8). 

As per claims 17, 28, Gillis further discloses that one or more of said two or more 
documents are located using an autonomous software or 'bot program ("software 
programs"; col. 10, lines 9-17; col. 25, lines 57-67). 

As per claims 18, and 29, Gillis further discloses automatically analyzes each 
document in a defined domain (source and target domains) or network by executing a 
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series of rules and assigning an overall score to the document ("average of component 
values"; col. 10, lines 9- 17; col .41, line 66-col.42, line 25). 

As per claim 1 9, Gillis further discloses that all documents with a score above a 
defined threshold are linguistically analyzed ("generate term vectors and accept only 
records that match all the categories beyond some minimum threshold"; col.46, line 65 
-col.47, line 11). 

As per claims 20, and 30, Gillis further discloses that the semantic vector is a 
quantification of the semantic content of each document ("semantic vectors"; col. 39, 
lines 14-20; col . 1 , lines 15-20). 

As per claim 21 , Gillis further discloses that each component has multiple 
dimensions ("n dimensional semantic space"; col. 39, line 63 - col.40, line 1). 

As per claim 31 , Gillis further discloses that the output of said defined algorithm 
is a measure of at least one of semantic distance, semantic similarity, semantic 
dissimilarity, degree of patentable novelty and degree of anticipation ("semantic 
similarity"; col.4, lines 1 - 3). 
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As per claim 36, Gillis further discloses that said term comprises at least one of a 
word or a phrase ("a small subset of terms (or groups of terms such as phrases) is 
chosen from the source domain"; col. 10, lines 19-22). 

As per claim 37, Gillis further discloses that comparing the semantic vectors 
based on a defined algorithm (col .42, line 2). 

As per claim 1 3, Gillis further discloses that the output of said defined algorithm 
is a measure of at least one of semantic distance, semantic similarity, semantic 
dissimilarity, degree of patentable novelty and degree of anticipation ("semantic 
similarity"; col.4, lines 1 -3). 

As per claim 38, Gillis further discloses that the at least one subordinate term 
includes synonyms of one of the subordinate terms (col.1 1 , line 8). 

As per claim 39, Gillis further discloses that one or more of the at least one 
subordinate term or the main term comprises a phrase (col.1 0, lines 1 9 - 22). 

As per claim 40, Gillis further discloses that the weighting factor comprises a 
plurality of different weighting factors and each of the different weighting factors relates 
to the importance of the main term or a subordinate term in the term group ("Vector of 
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terms that occurred less frequently in the training corpus are weighted more heavily in 
the calculation of summary vectors of search domain records"; col.41 , lines 43 - 47). 



Allowable Subject Matter 

6. Claims 33, and 35 are allowed over the prior art. The following is an examiner's 
statement of reasons for allowance: 

As to claim 33, and 35, Gillis does not teach or suggest that the defined metric is 
one of: Sqrt ( f 1 2 + f 2 2 + f 3 2 + f 4 2 + + f ( N - 1 ) 2 f N 2 ) n * 1 00 .wherein f is a 
difference in frequency of a common term between two documents and n is the number 
of terms those documents have in common; or Sqrt(sum((w-Delta)A2*w- 
Avg))/(Log(n)A3*1000), wherein w-Delta is the difference in weight between two 
common terms, w-Avg is the average weight between two common terms, and n is the 
number of common terms, between two documents. 



Conclusion 

7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to LEONARD SAINT CYR whose telephone number is 
(571) 272-4247. The examiner can normally be reached on Mon- Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571)- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
LS 

01/16/08 

/Richemond Dorvil/ 

Supervisory Patent Examiner, Art Unit 2626 



